PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bostr.10064s0036.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Boechereae; Boechera
Family HD-ZIP
Protein Properties Length: 796aa    MW: 86834 Da    PI: 6.3187
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bostr.10064s0036.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.66.8e-21132187156
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           +++ +++t++q++eLe++F+++++p++++r eL+k+l L++rqVk+WFqNrR+++k
  Bostr.10064s0036.1.p 132 KKRYHRHTPQQIQELESMFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMK 187
                           688999***********************************************999 PP

2START210.94.8e-663185371206
                           HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS...SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS
                 START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv..dsgealrasgvvdmvlallveellddkeqWdetla....ka 79 
                           ela++a++elvk+a +eep+Wvks     +++n+de++++f+++k     +ea+r+sg+v+ ++  lve+l+d++ +W+e+++    +a
  Bostr.10064s0036.1.p 318 ELALTAMDELVKLAHSEEPLWVKSLdgerDELNQDEYMRTFSSTKPtgLATEASRTSGMVIINSLALVETLMDSN-RWTEMFPcnvaRA 405
                           5899*************************************9999999***************************.************* PP

                           EEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCE CS
                 START  80 etlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksngh 161
                           +t++ is g      galqlm+aelq+lsplvp R++ f+R+++q+ +g+w++vdvS+++ +++   s+v+R  +lpSg++++++sng+
  Bostr.10064s0036.1.p 406 TTTDIISGGmagtinGALQLMNAELQVLSPLVPvRNVNFLRFCKQHAEGVWAVVDVSIEPVRENAGGSPVIR--RLPSGCVVQDMSNGY 492
                           ****************************************************************99******..*************** PP

                           EEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 162 skvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                           skvtwveh++++++++h+l+r+l++sgl +g ++w+atlqrqce+
  Bostr.10064s0036.1.p 493 SKVTWVEHAEYDENQIHQLYRPLLRSGLGFGSQRWLATLQRQCEC 537
                           *******************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.67E-20119189IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.606.8E-22120189IPR009057Homeodomain-like
PROSITE profilePS5007117.297129189IPR001356Homeobox domain
SMARTSM003899.5E-18130193IPR001356Homeobox domain
CDDcd000861.49E-18131189No hitNo description
PfamPF000462.0E-18132187IPR001356Homeobox domain
PROSITE patternPS000270164187IPR017970Homeobox, conserved site
PROSITE profilePS5084842.074309540IPR002913START domain
CDDcd088757.65E-112313536No hitNo description
SuperFamilySSF559613.3E-29313537No hitNo description
SMARTSM002349.7E-39318537IPR002913START domain
PfamPF018522.1E-59318537IPR002913START domain
SuperFamilySSF559617.38E-21556780No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009827Biological Processplant-type cell wall modification
GO:0042335Biological Processcuticle development
GO:0043481Biological Processanthocyanin accumulation in tissues in response to UV light
GO:0048765Biological Processroot hair cell differentiation
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 796 aa     Download sequence    Send to blast
MNFGSLFDNT PGGGSTGARL LSGLSYGNHT AATNNVLPGG AMAQAAGLFS PPLTKSVYAS  60
SGLSLALEQP ERRTNRGEAS MRKNNNNVGG DNFDGSVNRR SREDEHESRS GSDNVEGISG  120
EDQDAADKPP RKKRYHRHTP QQIQELESMF KECPHPDEKQ RLELSKRLCL ETRQVKFWFQ  180
NRRTQMKTQL ERHENALLRQ ENDKLRAENM SIREAMRNPI CTNCGGPAML GDVSLEEHHL  240
RIENARLKDE LDRVCNLTGK FLGHHHHHYN SSLELAVGTN NGGNFVFPPD FGGGACLPPP  300
QQQQTGINGI DQKSVLLELA LTAMDELVKL AHSEEPLWVK SLDGERDELN QDEYMRTFSS  360
TKPTGLATEA SRTSGMVIIN SLALVETLMD SNRWTEMFPC NVARATTTDI ISGGMAGTIN  420
GALQLMNAEL QVLSPLVPVR NVNFLRFCKQ HAEGVWAVVD VSIEPVRENA GGSPVIRRLP  480
SGCVVQDMSN GYSKVTWVEH AEYDENQIHQ LYRPLLRSGL GFGSQRWLAT LQRQCECLAV  540
LMSSSVTSHD NTSITPGGRK SMLKLAQRMT FNFCSGISAP SVHSWSKLTV GNVDPDVRVM  600
TRKSVDDPGE PPGIVLSAAT SVWLPASPQR LFDFLRNERI RCEWDILSNG GPMQEMAHIA  660
KGQDQGVSLL RSNAMNANQS SMLILQETCI DASGALVVYA PVDIPAMHVV MNGGDSSYVA  720
LLPSGFAVLP DGGLDGGSGD GEQRPVGGGS LLTVAFQILV NNLPTAKLTV ESVETVNNLI  780
SCTVQKIRAA LQCES*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00421DAPTransfer from AT4G00730Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK2269680.0AK226968.1 Arabidopsis thaliana mRNA for homeodomain protein AHDP, complete cds, clone: RAFL09-36-B10.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006287095.10.0hypothetical protein CARUB_v10000257mg
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLR0H8W00.0R0H8W0_9BRAS; Uncharacterized protein
STRINGAT4G00730.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein